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The probabilistic rule that links the formalism of Quantum Mechanics (QM) to the real world was 
stated by Born in 1926. Since then, there were many attempts to derive the Born postulate as a 
theorem, Gleason's being the most prominent. The Gleason derivation, however, is generally con- 
sidered rather intricate and its physical meaning, in particular in relation with the noncontextuality 
of probability (NP), is not quite evident. More recently, we are witnessing a revival of interest in 
possible demonstrations of the Born rule, like Zurek's and Deutsch's based on the decoherence and 
on the theory of decisions, respectively. Despite an ongoing debate about the presence of hidden 
assumptions and circular reasonings, these have the merit of prompting more physically oriented 
approaches to the problem. 

Here we suggest a new proof of the Born rule based on the noncontextuality of probability. Within 
the theorem we also demonstrate the continuity of probability with respect to the amplitudes, which 
has been suggested to be a gap in Zurek's and Deutsch's approaches, and we show that NP is 
implicitly postulated also in their demonstrations. Finally, physical motivations of NP are given 
based on an invariance principle with respect to a resolution change of measurements and with 
respect to the principle of no-faster-than-light signalling. 
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I. INTRODUCTION 

The fundamental probabilistic postulate which allows 
QM its successful predictions was first introduced by 
Born (from a suggestion by Einstein) in 1926 within 
the scattering theory [l|. The question if such postu- 
late could be derived as a theorem from other postulates 
of QM was quickly posed. A first answer was provided by 
Von Neumann and included in his famous book of 1932 
on the mathematical foundations of QM 3]. His theo- 
rem nowadays does not enjoy much attention since it is 
believed to contain more assumptions than Gleason's Q ■ 
Different attempts have been discussed in the context of 
the relative-state Q and the many- worlds interpretations 
@ of QM by Finkelstein Q and Hartle H, which have 
analyzed an endless sequence of measurements to show 
that the relative frequency follows the Born rule. How- 
ever the meaning in the real world of an infinite sequence 
of measurements is controversial and if these proofs con- 
tain circularities is not clear [|j]-[l3j|. 

The 1957 Gleason work Q is regarded as the most im- 
portant demonstration of the Born rule [lij]. Yet, this 
theorem is usually considered quite formal and difficult 
to grasp. Moreover, it is based on the concept of non- 
contextuality of probability (NP), whose connection with 
physics is not clear. In the structure of the Gleason the- 
orem, every Hilbert subspace Ai corresponds to an ob- 
servable quantity. Each one of such subspaces is rep- 
resentable by a projection operator 11^ and every or- 
thonormal basis of vectors { |a^) } is related to a complete 
mutually set of commuting projectors {IF = |dj)(aj|} . 
A projector n = \ip) can represent the question with 
the answer yes-no when we make an experiment testing 
if the system has the respective state \tp). A set of com- 
muting projectors corresponds to a set of questions which 



can simultaneously be asked in a measurement. In the 
logical-algebraic approach [ll|, probability is a measure 
defined over a projection lattice (set of closed subspaces) 
of Hilbert space, a mapping p : IFvi— >[0, 1]. The postu- 
lates on which the theorem rests are: 

1. The probability assigned to a complete set of pro- 
jectors is normalized, if dirriH = N: 



N 



l , p(ili) e [0, l] 



(i) 



2. For any sequence of mutually orthogonal projec- 
tors: 



M 



M 



PE^O M<N. 



(2) 



Postulate 2 is equivalent to assuming NP: the probability 
of a certain occurrence, for example the answer yes for 
a projector does not depend on other questions simulta- 
neously tested, that is, on other projectors. In terms of 
basis vectors, the probability of obtaining a given state is 
independent of the basis it belongs to. To illustrate this 
important point, let us consider, for instance, two com- 
plete set of mutually orthogonal projectors {IF}, {IF ; } of 
a Hilbert space with diraH = 3, where IF = H^. From 
the normalization postulate of probability: 



P (n 1 )+p(n 2 ) + p(n 3 ) = 1, 

p'(Tl 1 )+p(Tl' 2 )+p(Tl' 3 ) = l, 



(3a) 



(3b) 



where in principle p(Hi) and p'(Hi) may be different. 
In the subspace orthogonal to IF we have II2 + II3 = 



2 



ri'2 + n'3, then from Postulate 2: 

p(fl 2 )+p(iL 3 )=p(fl' 2 )+p(iL' 3 ), (4) 

and by comparing Eq. ([3j) with Eq. §4$ we get p(fti) = 
p'(fti). It is also possible to show the vice versa, i.e. if 
NP holds then Postulate 2 is true. Thus Postulate 2 is 
equivalent to postulate NP. From the Postulates 1 and 2 
it follows 

Gleason Theorem: For dimH > 3, there exists an opera- 
tor density p such that the probability measure for every 
projector ft is the rule trace: 

p(fl) = tr(pfL) , (5) 

namely, the Born rule. 

More recently, new proofs appeared in the literature 
which are "more physically motivated than Gleason's ar- 
gument" to demonstrate the Born rule. Particularly in- 
teresting are Deutsch's and Zurek's derivations (l6|-fl8j. 
also for their further motivation to demonstrate the in- 
herently probabilistic nature of QM and how the meaning 
of probability emerges from the theory. Such an ambi- 
tious goal, however, has been disputed and some authors 
suggest the presence of circularity flaws. According to 
^ is not possible to derive the Born rule with- 
out introducing from the very beginning some assump- 
tions about the existence of probability into the theory. 

Deutsch's and Zurek's demonstrations of the Born rule 
have similar structures. As a first step, they use an 
invariance principle in order to show that equal ampli- 
tudes correspond to equal probabilities. Deutsch finds 
such principle in the theory of decisions (for critical dis- 
cussions and further elaborations see [23j]-[25j]). Zurek 
introduces an environment-assisted invariance principle 
[26l [27j , envariance, in the framework of a relative-state 
approach where the system S under measurement jointly 
evolves with the environment £ and, possibly, with one 
or more auxiliary measurement devices. After demon- 
strating the correspondence between equal amplitudes 
and equal probabilities, both Zurek and Deutsch con- 
sider a fine-graining technique to deal with the general 
case of different amplitudes of the initial state. For this 
they have to introduce auxiliary systems which become 
entangled with S. 

However fine-graining has been questioned: Caves [l9[ 
considers disturbing the necessity of introducing addi- 
tional systems of adequate dimensions, possibly in an in- 
finite Hilbert space, in order to reach the wanted approx- 
imation to irrational probabilities. Barnum 22] points 
out an even more stringent weakness: the step from ratio- 
nal to irrational amplitudes requires the continuity of the 
probability with respect to such amplitudes and such a 
property is not demonstrated by Zurek and Deutsch. We 
note that a fundamental and extensive part of Gleason's 
theorem is actually devoted to prove that the probability 
is continuous. 

In the next section we intend to demonstrate the Born 
rule from the NP assumption. In our approach we do not 



enter into the controversy about the origin and the mean- 
ing of probability, and from the very beginning, following 
Gleason (see also Mohrhoff in Ref. |2(|), we assume the 
existence of probability as a measure definite on a quan- 
tum state. In addition, we do not suppose unitary evolu- 
tion and consider a single system, therefore our theorem 
is very close to Gleason's original formulation. However, 
differently from Gleason, we consider as a postulate the 
state of a system described by a vector in the Hilbert 
space, which allows a much more elementary deduction 
of the Born rule. As Gleason's theorem, our result holds 
for a dimension of the Hilbert space N > 3, where N 
could be finite. We also give a rather simple demonstra- 
tion that the probability is continuous from NP. Without 
this proof, which takes up a conspicuous part of Gleason's 
work, we could state that Deutch's and Zurek's results as 
well as ours, hold only for rational values of amplitudes. 

In Section III we generalize our theorem to include 
entangled states and multi-particles systems. Since the 
Hilbert space H of such systems has dimension which 
is the product of the dimensions of its components, the 
coupling of a subsystem with dirriH > 3 with another 
subsystem with dirriH = 2 enables us to extend the Born 
rule also to such subspaces. 

In Section IV we show that the Gleason postulate of 
NP is identifiable in the use of fine-graining and that 
such a postulate is also a part of Zurek's and Deutsch's 
proofs. Indeed, fine-graining is a resolution change of 
our measurements and NP is equivalent to a condition of 
invariance with respect to such a change: fine-graining 
on a subspace of the state must not alter the probability 
of finding the system in a different subspace. This res- 
olution invariance is an intuitive and physically natural 
interpretation of NP. 

Finally, in Section V, we show that the condition of 
noncontextual probability, or resolution invariance, can 
be also deduced by a hypothesis of no-faster-than-light 
signalling. 



II. BORN RULE AS A THEOREM FROM 
NONCONTEXTUAL PROBABILITY 

Contrarily to the principles of classical mechanics, the 
axioms of QM do not have an equally shared and clear 
formulation. We do not want to enter into such a thorny 
issue, related to the interpretative problems of QM. For 
our purpose, from the usually regarded quantum postu- 
lates in standard presentations of QM (see, e.g., [28[ - 
[3ll| ) . we can extract the following common assumptions: 

Postulate I: At the time t the state of every physical 
system S is described by a normalized vector \tpg(t)) be- 
longing to a Hilbert space H. 

Postulate II: Every measurable physical quantity A is 
described by a Hermitian operator A acting in %. 

Postulate III: The only possible result of a measurement 
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of an observable A is one of the eigenvalues {a^} of A, 
(as the customary habit, from now on we will identify 
the observable with its operator). 

Postulate I is known as the Completeness Postulate of 
QM. Postulate II links an observable quantity with an 
operator in a Hilbert space (observable-operator link). 
Postulate III connects a particular value of that observ- 
able with an eigenvalue of the corresponding operator 
(value-eigenvalue link). 

Now consider an observable A with discrete eigenval- 
ues {aj} and eigenstates {|<2j)}, i= 1,...,N, and a single 
system S in the state \ips) € H N . {\a,i}} is a complete 
orthonormal set and \ips) can be decomposed into com- 
ponents: 

JV 

b/> s ) = 5>k). ( 6 ) 

i=l 

We intend to prove the following 

Theorem 1: If NP and the postulates of QM hold (except 
for the Born rule and the unitary dynamics), then for 
N > 3 the probability of obtaining the non-degenerate 
eigenvalue at, measuring A on the state \ips)> is neces- 
sarily given by the square modulus of the inner product: 

p(o,0 = |(a#s(i))| 2 Born Rule, (7) 

where |<Zj) is the eigenstate associated with the eigenvalue 

We assume the probability as a primitive concept and we 
begin the demonstration of Theorem 1 by proving some 
simple lemmas. We underline that any theorem having 
the aim to prove the Born rule has to face the fact that 
the connection between probability and quantum state 
must be postulated or deduced by other means. Gleason, 
for instance, postulates the probability to be a measure 
definite on states of the basis. 

As Gleason, we suppose QM to be a probabilistic the- 
ory, which makes predictions about probabilities of oc- 
currences of observable quantities, and that probabilities 
are functions of quantum states. Then, from QM postu- 
lates we want to show that 

Lemma T. The probability of obtaining an eigenvalue 
dj is equal to the probability of obtaining the respective 
eigenstate |a») (statistical case): 

pK)=Kk»- (8) 

Proof: Suppose the system is prepared with the eigen- 
value a.;. In this case p(ai) — 1. According to the postu- 
late of QM (eigenvalue-eigenstate link): 

Postulate IV: If the measurement of the observable A 
on the system in the state \ifjs(t)) gives the result Oj, the 



state of the system immediately after the measurement is 
the associated normalized eigenstate |ai) (non-degenerate 
case). 

Hence if p(<2j) = 1 then p(\a,i}) = 1, and vice versa. 
But {di} correspond to mutually exclusive events, thus 
has to be piflj) = for j =/= i. Then must neces- 
sarily be p(\a.j)) — too, since, otherwise, having ob- 
tained with a measurement the state \a,j) we should as- 
sign to the system the eigenvalue a,j, in contradiction 
with the exclusivity property of {a^} events. Therefore 
we have the equation between conditional probabilities 
(non-statistical case): 

p(\oi);aj) =p(ai\ |o,-}) = . (9) 

Because of Postulate III of QM, the only possible result 
of a measurement is one of the eigenvalues of A. Then 
from the Bayesian formula of the joint probability, for a 
generic preparation \ips) °f the initial state, we have 

N 

p(\ a i)) = J2 p ^ ai ^ a ^ 

(10) 

i=i 

where we have used Eq. ^ in the conditional probabil- 
ities of Eq. (JTD). □ 

We note from Eqs. (|9lT0| that 

N 

p(\a i );\a j ))=S ij , £>(K» = 1- (U) 

i=l 

Lemma 1 can be easily generalized for entangled states 
as well. For instance, consider the state: 

N M 

t=l fe=l 

where B is a new observable commuting with A, with 
eigenvalues and eigenstates k — 1,...,M, 

and \i/>sb) € H N ® H M , (we point out that B does not 
necessarily have to belong to an other system: for ex- 
ample, A could be the spin observable and B the posi- 
tion observable of one particle). Consider the pairs of 
eigenvalues {cn,bk}, and eigenstates {|aj), ]&&)}. With 
the trick of the conditional probabilities, it follows that 
on the generic state \i/jsb)' 

Kk)l& fc » =p(\oi),\h)) 

(13) 

= p{{a l ,b k }) 1 

(see appendix lAlfor a proof of this). Then the probability 
p(a,i) is again given by: 
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A I 



A I 



P( a i) = ^2p{i a uh}) = ^y>(\ai)\h)) 



k=l 
M 



k=l 



(14) 



5>({|oi>.IM})=P(|oi»- 



k=l 



With a similar reasoning we can also show that p(bk) — 
p(\bk))- We notice from Eqs. (|13I14|) that detections 
of {|di)|6fc)} are mutually exclusive events as well, with 
the condition of normalization which follows from the 
normalization of p(a,i). 

Lemma 2: If a coefficient of the development in Eq. © 
is zero, the probability of obtaining the associated eigen- 
value and eigenstate is zero as well. 

Proof: If, for instance, Ck = 0, the corresponding state 
|ojfe) will be orthogonal to \ips)- We can think {\ips), \ a k)} 
as eigenstates of the projector II5 = \ips)(ips\ with eigen- 
values {1,0}. The Hermitian operator II5 represents the 
observable testing whether the system S is prepared with 
the state \ips) or not. The pair {\tps), \ a k}} is a complete 
orthonormal basis of a subspace W 2 of H N ■ 

With the system prepared in the state |^\g): 
p(\ips)', l^s}) = !• From Lemma 1, it follows that 
p(l; \ips)) — 1 and because eigenvalues are exclusive 
events p(0; \ips)) — 0. So, if we apply Lemma 1 again 
to the previous equation, considering that |afc) is eigen- 
state of both lis and A, we have p(\ak); \ips)) = and 
p(a k ;\ip s )) =0. □ 

Also Lemma 2 can be directly generalized to entangled 
states. For instance, if in the state |V\sb) the coefficient 
Chi = then p(\ak)\bi); |V>sb)) = 0. In particular, if the 
state of the system is in a Schmidt decomposition: 



N 



(15) 



Proof : Let us recall the meaning of NP with an example. 
Consider two operators A and A' which do not commute, 



A, A' 



7^ 0. We take a Hilbert space with dirnH = 3. 



Eigenvalues and eigenstates of A and A' are, respectively: 



A: {ai, a 2 ,a 3 }, {|ai), |a 2 ), |a 3 )} 
A'; {a u a' 2 ,a' 3 }, {K), \a' 2 ), \a' 3 )} 



(18) 



where they have in common the eigenvalue a\ and the 
associated eigenstate (we see that this is possible, if A 
and A' must not commute, just for N > 3). We can write 
the state \ips) in the two different bases of Eq. (fl"8|) : 



IV's) = ciK) + c 2 |a 2 ) + c 3 |a 3 ) 
= d\ax) +c 2 |a 2 ) +c' 3 \a' 3 ) . 



(19) 



In general, the probability of obtaining the eigenvalue a\ 
could depend on whether we measure the observable A or 
A' . Probability could be different if we arc in the exper- 
imental context which is prepared for measuring which 
value of the triplet {eii, a 2 , a 3 } A will have, or which value 
of the triplet {01, a' 2 , a' 3 } A' will have. 

With the result of Lemma 1 and the completeness pos- 
tulate of QM, we can write the more general probability 
in different experimental contexts as: 



PA(ai) = p(\ai); \ip s ), {M, |a 2 ), |a 3 )}) , 
PA'(ai) = p(\ai); \ip s ), {|ai), \a' 2 ), \a' 3 )}) . 



(20) 



The probability is noncontextual if p{a\) does not depend 
on the measurement with other eigenvalues: 



PA(ai) = PA'(ai) , 



(21) 



or, according to Lemma 1, the probability p(\ai}) does 
not depend on the measurement with other eigenstates: 



from Lemma 1 and Lemma 2, since p(\ai)\bk)) = for 
k 7^ i, follows the perfect correlation property: 



A I 



P( a i) = ^2p({^i bk}) = p(\a,i)\bi)) = p(\a t )) 
= p(\b i ))=p(b i ). 



k=l 



(16) 



Lemma 3: If the probability is noncontextual the proba- 
bility of obtaining the eigenvalue can depend only on 
the modulus of the coefficient c, : 



Pa(M) =PA>(\ai}) ■ 



(22) 



Then, by assuming NP, Eqs. (|20|) are reduced to the 
equation: 



(23) 



stating that the probability only depends on the eigen- 
state |ai) and, from the completeness postulate of QM 
and NP, on the state of the system \ips)- 

Thus, let us choose a new basis {|a^)} such that the 
state in Eq. ((5]) is written with only two components: 



P{ai) =p(\ci\) 



(17) 



\<ips) = ci\ai) + |c 2 ||a 2 ) 



(24) 



5 



where c[ — for i > 3, |c' 2 | is the modulus of the ampli- 



tude Co and 



\a[) ee | ffll ) 



■E 

i=2 



f-TT Vc t |a, ; ) 



(25) 



{|oi), |a' 2 )} could be regarded as eigenstates of an observ- 
able A' = |ai)(ai| with eigenvalues {1,0} checking if our 
system is in the state |ai) or not. 

According to NP the transformation of basis — > 

{\a'i)} of the state cannot change the probability: 



p(K); \tps)) = P(\ai); |-0S»{|a»)} 
= p(K); l^s)){| a ;)} 



(26) 



Now we expressly write the phase of the coefficient c\ in 
the state of Eq. 



(27) 



|Vs>=e^|c 1 ||a 1 ) + |4||4). 
We get for the state \a{) the probability: 

p(| 0l ); |Vs)) = p(M; e^\ Cl \\a x ) + \c' 2 \\a' 2 )) . (28) 



J 12 
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Therefore, considering the dependence |ci| 2 
and that the components of the orthonormal basis are 
the constants {a'^a'j) = the most general function of 
the probability in Eq. (|28|) can be written as: 



p(|oi);|^ s »=p(^i,|ci|). (29) 
A similar argument when applied to every state |aj) gives 

p(\ ai );\^s))=p(ct>iAci\). (30) 

However, by using the conservation of probability we can 
show the independence from the phase of p(|a.j); \ips))- 
Indeed, let us suppose to have two different states \ips) 
and \ipg) which are distinct just with respect to the phase 
of the component \ai) (to simplify the notation, we se- 
lect a basis such that the components of the states are 
different from zero only in a subspace with dirriH = 2): 



1^5 



ci ai)+e*» c 2 |a 2 ) , 



(31) 



ci ai) -I- e <p2 |c2||a 2 ) , 



where with (f>* we have denoted the new phase. The 
conservation of the probability requires: 



p(M;|Vs))+p(|a 2 );|Vs)) =1. 
KM;|^))+p(|a 2 );|^>) = 1- 



(32) 



where from Lemma 2 we have used the fact that 
p(\a>i);\ips)) = if |ai)_L|^ s >- Thanks to Eq. §U\), we 
have that 



p(\a 2 );\i> s ))=p(\a 2 );\r s )) 



(33) 



From this and Eqs. (|32|) . we also get: 

p(|a 1 );|^))=p(|a 1 );|#)) 1 (34) 

which is equivalent to write: 

p(0Mci|) = P (0i,| Cl |). (35) 

Therefore, since ^ is an arbitrary phase, the probability 
p(a\) does not depend on it but can only depend on the 
modulus |ci|. So, we can set p{a\) — p(|ci|), and this 
proves our lemma (the generalization to probabilities of 
other eigenvalues a, is immediate). □ 

Now we have all the ingredients to prove our Theorem 
1: the probability p{a,i) is given by the square modulus 
of coefficient Cj. 

Proof : Let us consider the state \ips) G H N Eq. © with 
N > 3. For the sake of simplicity we focus our attention 
again on p(a%). We choose two bases {|di)} and {|a^)} 
such that 



ci\ai) + c 2 |a 2 ) + c 3 |a 3 ) 



ci|ai) + c' 2 |a 2 ) ! 



(36a) 
(36b) 



where Cj = for i > 4, c\ = for i > 3 and |aj) = \a'i)- 
From the conservation of the total probability for the 
states of the two different bases we get respectively: 

p(|ai))+p(|a 2 ))+p(|a 3 )) = l, (37a) 

p(|oi))+p(|4)) = l, (37b) 

where for NP we have the same probability p(\ai}) in 
Eqs. (|37p . With a change of variable we can write Eq. 
(IT71) as p(ai) = f(\c t \ 2 ). From Eqs. (|37J Jt follows that 



/(|ci| 2 ) + /(M 2 ) + /(|c 3 | 2 ) =1 



/(|ci| 2 ) + /(|c 2 | 2 ) =1 



(38) 



By comparing the two Eqs. (|38|) we get: 

/(|c 2 | 2 ) = /(| C2 | 2 ) + /(| C3 | 2 ). (39) 
Since the state \ips) is normalized, we also have 

|ci| 2 + |c 2 | 2 + |c 3 | 2 = l, (40a) 



M 2 + I4l 2 = i 



and 



J 1 2 



(40b) 



(41) 



|Oa| - |C 2 | + |C 3 | . 

Putting this in Eq. ([39]): 

/(|c 2 | 2 + | C3 | 2 ) = /(| C2 | 2 ) + /(|c 3 | 2 ). (42) 
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Now we recall that a function f(x) is linear with respect 
to the variable x if and only if the two following properties 
are jointly satisfied: 

f(xi + x 2 ) = f(xt) + f(x 2 ) additivity , 

(43) 

f(ax) = af(x) homogeneity, 

where a is any real number. We remark that if a is a 
rational number, it can be shown that the homogeneity 
follows from the additivity. Furthermore, since rational 
numbers form a subset dense in the set of real numbers, 
such demonstration can be extended to the case of irra- 
tional a, if / is a continuous function (3^ . 

Therefore, the condition of additivity is sufficient to es- 
tablish the linearity of the function / if this is continuous. 
On the contrary, if / is not assumed to be continuous, the 
condition of additivity implies linearity only for rational 
values of the variable x. 

Hence, if we assume that the probability is continuous 
with respect to the coefficients |ci| 2 , Eq. ([4*21 is a con- 
straint of linearity for the probability. In this case we can 
write: 

p(ai) = fc| Cl | 2 , i = 2,3, (44) 

where k denotes a constant. From Lemma 1 we have 
that p(|ci| 2 = 1) = 1 and therefore k = 1 and p(a%) = 
|ci| 2 . Hence, the Born rule is deduced for continuous 
probabilities. 

We point out that we have postulated the continuity of 
probability. However, a demonstration of such a property 
from noncontextuality is present in Gleason's work and 
it occupies a prominent part of that paper. Therefore in 
comparison to Gleason we have only accomplished half 
the job: as Bertrand Russell once said, postulating is 
equivalent to theft on the honest fatigue. On the other 
hand, if the continuity of probability is neither postulated 
nor demonstrated, we can only say that the Born rule 
applies only for rational values of the coefficients |cj| 2 . 

Thus we now intend to complete our demonstration by 
showing from NP that the probability is continuous and 
that the Born rule also holds for irrational coefficients. 

Consider, once again, p(\ai)). We select a basis {|a-)} 
such that the state \tps) is written with two components 
as in Eq. (|36b[) . with \a[) = |ai), and the conservation 
of probability is given by Eq. (|37b[) . 

Then we choose a new basis {|a")} in general with 
\a") ^ \ai), c'l = for i > 4, but with |a 2 ') = \cQ so that 
we can write 

|^s) = ciX) + 4K) + 4X>. (45) 

The conservation of probability for the new states of the 
basis requires that 

KK))+KI«2))+KK)) = i- (46) 

From the previous condition we have the inequality: 

KK»+p(K»<i. (47) 



By comparing Eq. (|37b[) with Eq. (1471) and since, thanks 
to NP, the probability p(\a' 2 )) is the same in these equa- 
tions, we get 

P(K»< (48) 

Finally, we choose a third basis {|a"')} such that c!" = 
for i > 4, this time with \a'{') = |ai): 

\*Ps)=c 1 \a 1 )+4'K')+4'\4'}- (49) 
The conservation of probability gives us: 

p(|oi))+p(K))+p(K)) = i. (so) 

From Eq. (|50[) we get the inequality: 

Kh»<i-KK"»- ( 51 ) 

Therefore the probability p(\ai}) is in the range: 

KK»<p(|ai»<l-p(K'))- (52) 

Suppose | ci | 2 to be an irrational number, we choose ra- 
tional values for \c'{\ 2 and |c 2 "| 2 so that p(K)) = l c i'| 2 
and p(K'» - |4T- By putting p(| 0l )) = p(|d| 2 ), Eq 
(1521 becomes 

Kf <p(| C i| 2 )<i-|c' 2 T, |4'iM4TeQ, (53) 

with |c'/| 2 < 1 - IcJ,"! 2 . Let us denote with \i/j s ±) the 
orthonormal state to \ips)- Consider the limits \a") —> 
\ ai ), \a'i) \iPs±) and |a' 2 ") ^ \a> 2 ), \a%') \^s±)- We 
have K| 2 -^J Cl | 2 , \4\ 2 and K'f |c 2 J 2 ; |c^"| 2 -> 
0. Summarizing, with the normalization condition in Eq. 
(|40b|) we get: 

|c'/| 2 ^| Cl | 2 , l-|c 2 "| 2 ^| Cl | 2 . (54) 

Because rational numbers form a dense set in the set of 
real numbers, we can always find a couple of numbers 
|c"| 2 , 1 — |c 2 '| 2 € Q which are as near as we want to 
the irrational number |ci| 2 . Therefore, from Eq. (|53")) it 
follows that the oscillation of the function p is zero at 
|ci| 2 . Consequently, the probability must be continuous 
at |ci| 2 andp(|ci| 2 ) = |ci| 2 even if | Ci | 2 £ Q. □ 



III. NONCONTEXTUAL PROBABILITY FOR 
ENTANGLED STATES 

Theorem 1 with NP can be generalized to include also 
entangled states, bipartite or multipartite systems and 
quantum measurement devices. 

Let us indicate with C the context, including our ex- 
perimental devices and the environment, with \d) £ H M 
its possible states. Consider again the measurement of 
the two observables A and A' introduced in the proof of 
Lemma 3 in the previous section. The choice of measur- 
ing either the eigenvalues {a 2 ,as} or {0^,03} together 



7 



with a\ corresponds to two different experimental ar- 
rangements A and A'. 

Suppose that at time to the system is in the initial state 
|^Sc(*o)) = IV-^ICo)- At time t, after the interaction 
with the measurement device, the entanglement between 
the system S with the experimental setup C correlates 
the observable A (or A') of the system with the pointers 
of the experimental device. If we measure A we have the 
state: 

\^sc(t)) =c 1 \a 1 )\C 1 )+c 2 \a 2 )\C 2 )+c 3 \a 3 )\C 3 ), (55) 
while if we measure A' we get: 
\1>sc>(t))= c 1 \a 1 )\C[)+ 414)1^) +c' 3 \a' 3 }\C 3 ). (56) 



The global wave functions Eqs. (|55fl56[) are different in 
the different contexts A and A'. So the two probabilities 



PA ( ai ) =p(\a 1 )\C 1 );\i>so(t))), 
PA ,{ ai ) =p(\ai)\C' 1 )-AiPsc(t))), 



(57) 



may be different. The possible contextuality of probabil- 
ity is now included inside the description of the global 
system S + C of the quantum state. We notice that it is 
no longer possible to write 

PA ( ai ) = p(|oi)|Ci); IVscW), {M, |02>, |os)}) , (58) 

if we accept the completeness of QM. Otherwise, indeed, 
some parameters would be outside the description of the 
state \i/>sc(t)), which now englobes not only the system 
but also the context of the measurement. This should 
imply the existence of hidden variables. 

Assuming NP requires the equality between the prob- 
abilities in Eq. ([57)) : 



p(|ai>|Ci>; \i>sc(t))) = p(\ai)\Ci); , (59) 



which generalizes the condition in Eq. (j2"2")l . In fact, 
Eq. (l59l) by the property of perfect correlation becomes 
equivalent to Eq. (|2"2"1) . 

In a similar way as in the previous demonstration, by 
using the state of Eq. (|5"5"|) and a state \ips) with a new 
basis {|ai)} such that c' i = Q for i > 3, \a[) = \a%) we can 
write: 



\i>sc'(t)} = c 1 \a 1 }\C' 1 } + |4II4)I^) 



(60) 



From the condition of NP it is not difficult to deduce that 
p(\ai)\Ci);\ipsc(t))) =p(\(k\) must hold. Then, employ- 
ing states like in Eq. ([5l5"f and Eq. ([BTJf and from NP, 
normalization condition and conservation of probability, 
it is possible to prove again the linearity of the probabil- 
ities with respect to the square moduli of the amplitudes 
(continuity can also be generalized to the present case). 
Putting all pieces together, we get 



Kk>|Q);|^c(*)» 



= \{ ai \{CA\Mt))\ 2 
= \(a^s)\ 2 . 



(61) 



We note that the state of the global system S + C is in 
the Hilbert space T-L N (g> T-L M with dimension N ■ M . If 
M > 3 this allows us to include in the demonstration of 
the Born rule also states with N = 2. In fact, consider a 
system S described by a state \ips) in a Hilbert space with 
dirnH = 2 and basis {|ai), l^a)}- With a measurement 
we could use the context C as an ancilla system to build 
the following entangled states: 



'sci 



■ ciMId) + c 2 |a 2 )|C 2 ) 



(62) 

|^c0 = oi\ ai )\Ci) + c' 2 \a 2 )\C! 2 ) + c' 3 \a 2 )\C> 3 ) , 

where \ipsc) ^ s obtained by dividing the part of the state 
\ips) associated with the eigenvalue a 2 into two regions. 
In a similar way it is possible to demonstrate that p(a>i) = 
K^ilV's)! 2 as hi the previous cases. 



IV. NONCONTEXTUAL PROBABILITY AND 
RESOLUTION OF THE MEASUREMENT 

A simple application of Lemma 3 shows that equal am- 
plitudes are related to equal probabilities, i.e.: if c, = Cj 
with i ^ j then p(cii) = p{flj). Whereas Lemma 3 is de- 
rived from the NP principle, the same result was obtained 
by Zurek and Deutsch from two different invariance prin- 
ciples. From here both authors study the more general 
case with states having different rational amplitudes us- 
ing the fine-graining technique. We shortly summarize 
their results. We consider the fine-graining in Zurek's 
approach with the environment in a simple situation (our 
remarks also hold for Deutsch's similar derivation). We 
assume that the state of the system S is in a Hilbert 
space with dimH = 2, entangled with the the state of 
environment £ such that their joint state is 



\i>ss) = ~7^l ai )l £l ) 



N - 1, 



N 



(63) 



The trick is to transform this state with unequal coeffi- 
cients into a state with equal coefficients by fine-graining. 
This goal is reached by extracting from the environment 
an ancilla system C having states 



N 



\C{) = |C X ) 



\c 2 ) = J2\c1)/Vn^T. 



(64) 



which become correlated with the states {\ai), \a 2 )} of 
S. Within a quantum measurement, C can be regarded 
as a counter and {|Cj)} as a new orthonormal basis of 
pointer states. Denoting with {|ej)} the new states of the 
environment without C, after the interaction between S 
and C we have the joint system 



1 



iVtecs) - -^=|ai)|Ci)|ei) + 



N-l 

N 



a 2 )\C 2 )\e 2 ), (65) 
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which can be expressed with the new basis of pointer 
states of Eq. 

tysee) = 4f l«i>|Ci>|ei> + 4= E l«2>l^>je 2 ) . (66) 

Now we have a new state with equal amplitudes. But 
how can we be sure that after fine-graining the proba- 
bility p(|oi)) with the state \ipscs) is the same with the 
state \ipsce)7 In principle they could be different, so to 
legitimately use the fine-graining reasoning we have to 
assume the equality: 

p(K>; \i>sce)) = p(\ai); \ipsce)) , (67) 

equivalent to the condition of NP encountered in Eq. 
(|59p of section 3. Then, in their demonstrations Zurek 
and Deutsch implicitly assume NP. We notice that fine- 
graining is equivalent to a change of measurement reso- 
lution, and this has been used in our derivation as well. 
A change of resolution, for instance, is considered going 
from Eq. (137a) to Eq. ([37b]) . There we assumed from 
NP that p(\ai)) is the same in the two equations and 
therefore: 

p(\a' 2 ))=p(\a 2 ))+p(\a 3 )), (68) 

where p{\a' 2 )) is the probability of finding the state of 
the system in the subspace orthonormal to |ai), whereas 
p(\a 2 )) and p(|<Z3)) are the probabilities of finding it, re- 
spectively, in the subspaces \a 2 ) and \a 3 ) of the subspace 
orthonormal to \a\). In fact, in terms of projectors, Eq. 
(f68|) can be written as Gleason's Postulate 2: 

p(\a 2 )(a 2 \ + \a 3 )(a 3 \)=p(\a 2 )(a 2 \)+p{\a 3 )(a 3 \) . (69) 

Thus NP could be physically interpreted as a probability 
invariance when we change the resolution of the measure- 
ment connected with other properties of the system. This 
interpretation gives physical insight to the NP principle 
as an invariance for the resolution change. 

V. NONCONTEXTUAL PROBABILITY AND 
NO-FASTER-THAN-LIGHT SIGNALLING 

In general, every measurement in the real world hap- 
pens in the spacetime. Every measurement of any ob- 
servable is necessarily reduced to a measurement of po- 
sition at a certain time. For instance, the Stern-Gerlach 
apparatus correlates a spin observable with a position 
observable so that different values of the spin univocally 
correspond to different position values of the system. A 
measurement of position detects the spin value of the 
system. 

Let us introduce in our representation of S the ob- 
servable position X. We assume that at time to, the 
system is in the initial state \ipsc(to)) = \4's)\ x o)\Co) . 
For simplicity of notation, from now on we denote with 



\C X i) = \xi)\Ci) the position state of the system with 
its context, where the letter x will remind us the depen- 
dence on the position of the system and the experimental 
apparatus. 

We measure the observable A or A'. If we measure A 
we get: 

\^sc(t)) = c l \a l )\C xl )+c 2 \a 2 )\C x2 )+c 3 \a 3 )\C x3 ), (70) 
while if we measure A' we have the different state: 

\1>sc>(t)) =c 1 |a 1 >|Ci 1 >+4|4>|C4 2 )+4l4)|Ci 3 >. (71) 
We want to prove the following 

Theorem 2 : If we assume the no-faster-than-light 
signalling condition for measurement events which are 
spacelike separate, then the probability p{at) must be 
noncontextual. 

Proof. If the measurements of {a 2 , a 3 } or {a' 2l a' 3 } at the 
points x 2 and x 3 are events spacelike separate with re- 
spect to the measurement of a± at x±, according to the 
relativistic hypothesis of no-signalling, the probability 
p(a±) of getting the eigenvalue ai has to be the same 
in both different experimental contexts A and A'. Oth- 
erwise an observer situated in the place of measurement 
of {02,03} and {a 2 i a 3} would be able to send a super- 
luminal signal to another observer situated in the place 
where a\ is measured. Then, according to the no-faster- 
than-light signalling hypothesis: 

p(|a 1 )|C 3! i>;|^sc(*)»=p(|ai>|C , xi>;IV'sc'(*))). (72) 
which is to say, with the result of Eq. (Ti~6"| : 

p{ai) =p(\ai);\if>sc{t))) 

(73) 

= p(\a 1 );Wsc>(t)))- 

Therefore p(a±) must be noncontextual. □ 

Consider, beside the measurement of the eigenvalues 
{ai, a 2 , a 3 } of the observable A, also the simple possibil- 
ity of measuring whether our system has the eigenvalue 
a± or not, which is equivalent to check whether the system 
S is in the state \ai) or in the subspace |fflix) orthogonal 
to it. In such a case, during the process of measurement, 
the system S + C will be in the state: 

\i>sx x {t)) = cx\ai)\C xl )+c 21 _\a 11 _)\C x2X ). (74) 

If the two choices of measuring either {|ai), \ai), \a 3 }} or 
{|ai), |<2i_i_)} correspond to events spacelike separate, as 
mentioned above, from the condition of no-signalling we 
must have the NP condition: 

p{a l ) =p(\ai);\ip S c(t)}) 

(75) 

= p(\a 1 );\i>scAt)))- 

From Eq. (|75"|) the conservation of probability imposes 
also the equation: 

p(\ai±); tyscAt))) = 

(76) 

= p(|o 2 >; \^sc(t))) +p(\a 3 ); \Tpsc(t)}) ■ 
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How we showed at the end of Section IV, Eq. ([76]) is 
equivalent to the initial hypothesis adopted by Gleason. 
Moreover, by examining Eq. (f75|) we understand that it 
is analogous to assuming an invariance condition of the 
probability under a resolution change of our measure- 
ment, or fine/coarse graining. So, at least for spacelike 
separate measurements, NP could be deduced by a rela- 
tivistic principle. However, the Born rule holds for gen- 
eral events, not only for spacelike separate ones. Hence 
NP or, equivalently, the invariance of probability when 
we change resolution, seem to be more general principles. 

VI. CONCLUDING REMARKS 

We have given a new demonstration of the Born rule 
based on noncontextual probability and on some non- 
statistical postulates of QM. The introduction of a pure 
quantum state of the system has allowed a more elemen- 
tary proof than Gleason's theorem. This may seem, at 
first sight, a loss of generality with respect to Gleason's 
proof, however Gleason also introduces vectors of basis 
which can be regarded as states of the system. More- 
over, the step from pure states to density operators and 
the trace rule for a mixed ensemble is quite natural when 
we have the Born rule in hand [33j . 

As in Gleason's, our demonstration holds for dimH > 
3, where dimH. could be finite or infinite. The origin 
of this dimensional limitation lies in the fact that for 
dimH > 3 the same state vector may belong to differ- 
ent bases. This, with NP and the normalization of the 
probability and the quantum state, brings us to a con- 
straint of functional linearity between the probability and 
the amplitude square modulus. Our theorem can be sim- 
ply generalized in order to include multipartite states. In 
such a case, if the state is dimH > 6, the Born rule can be 
demonstrated to hold also in subspaces with dimH. = 2. 

We have also given a demonstration that the probabil- 
ity is continuous on the amplitudes using NP. 

Physical motivation for NP could come from a rela- 
tivistic principle of no-faster-than-light signalling. How- 
ever, the Born rule also holds for measurements which 
are non-spacelike separate, hence NP seems to be a more 
fundamental principle. Finally, we have given an inter- 
pretation of NP as an invariance of the probability under 



a resolution change of our measurement. 



Appendix A 

We want to generalize the result of Lemma 1 for en- 
tangled states. Suppose that the system is prepared 
with eigenvalues {cii,bk} and therefore with eigenstates 
{l&i); |frfc)}- Similarly to the unentangled case, we 
can deduce that p({dj,bi}) — SijSki if and only if 
SijSki- This relation, translated into con- 
ditional probabilities, is equivalent to write: 

p({<H,h};{\aj),\h)}) = 

(Al) 

P({\ a i), \ b i)}- {a tl b k }) = SijSki . 

Then, from Eq. (|A1|) we have that for a generic state as 
in Eq. (TTJ): 

JV M 

P{{ai,h}) = ^^^({a^bfel^laj), \bi)}) 
j=i i=i 

N M (A2) 

= EE m i {K>< i & »)})p({i°i>. 

= p({\a i ),\b k )}). 

If the observable A has an eigenstate |a») and the observ- 
able B has an eigenstate \bk) then the tensorial product 
|ai)|6fc) is an eigenstate of the observable A®B, and vice 
versa. The eigenstates {|ai)|fefc)} of A®B form a basis of 
Hilbert space T4 N <g> T-L M . If we indicate with p(\aj)\bi)) 
the probability of obtaining the eigenstate |aj)|6/), with 
a measurement of A eg) B, we have p({\cij), \h)}) = SijSki 
if and only if p(\aj)\bi)) = SijSki, or, in equivalence: 

K{k),|& fc )};hW) = 

(A3) 

p(\ a j)\h);{\ai), \ b k)}) = SijSki . 

From Eqs. (|A2|) - (|A3|) . through the use of the conditional 
probabilities, it immediately follows that on the state 
\iPsb) we have p(|a,)|6 fc )) = p({cn,b k }). 
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